智能论文笔记

POLCOVID: a multicenter multiclass chest X-ray database (Poland, 2020-2021)

Aleksandra Suwalska , Joanna Tobiasz , Wojciech Prazuch , Marek Socha , Pawel Foszner , Jerzy Jaroszewicz , Katarzyna Gruszczynska , Magdalena Sliwinska , Jerzy Walecki , Tadeusz Popiela

分类：计算机视觉

2022-11-29

The outbreak of the SARS-CoV-2 pandemic has put healthcare systems worldwide to their limits, resulting in increased waiting time for diagnosis and required medical assistance. With chest radiographs (CXR) being one of the most common COVID-19 diagnosis methods, many artificial intelligence tools for image-based COVID-19 detection have been developed, often trained on a small number of images from COVID-19-positive patients. Thus, the need for high-quality and well-annotated CXR image databases increased. This paper introduces POLCOVID dataset, containing chest X-ray (CXR) images of patients with COVID-19 or other-type pneumonia, and healthy individuals gathered from 15 Polish hospitals. The original radiographs are accompanied by the preprocessed images limited to the lung area and the corresponding lung masks obtained with the segmentation model. Moreover, the manually created lung masks are provided for a part of POLCOVID dataset and the other four publicly available CXR image collections. POLCOVID dataset can help in pneumonia or COVID-19 diagnosis, while the set of matched images and lung masks may serve for the development of lung segmentation solutions.

translated by 谷歌翻译

Comparison and Evaluation of Methods for a Predict+Optimize Problem in Renewable Energy

Christoph Bergmeir , Frits de Nijs , Abishek Sriramulu , Mahdi Abolghasemi , Richard Bean , John Betts , Quang Bui , Nam Trong Dinh , Nils Einecke , Rasul Esmaeilbeigi

分类：人工智能

2022-12-21

Algorithms that involve both forecasting and optimization are at the core of solutions to many difficult real-world problems, such as in supply chains (inventory optimization), traffic, and in the transition towards carbon-free energy generation in battery/load/production scheduling in sustainable energy systems. Typically, in these scenarios we want to solve an optimization problem that depends on unknown future values, which therefore need to be forecast. As both forecasting and optimization are difficult problems in their own right, relatively few research has been done in this area. This paper presents the findings of the ``IEEE-CIS Technical Challenge on Predict+Optimize for Renewable Energy Scheduling," held in 2021. We present a comparison and evaluation of the seven highest-ranked solutions in the competition, to provide researchers with a benchmark problem and to establish the state of the art for this benchmark, with the aim to foster and facilitate research in this area. The competition used data from the Monash Microgrid, as well as weather data and energy market data. It then focused on two main challenges: forecasting renewable energy production and demand, and obtaining an optimal schedule for the activities (lectures) and on-site batteries that lead to the lowest cost of energy. The most accurate forecasts were obtained by gradient-boosted tree and random forest models, and optimization was mostly performed using mixed integer linear and quadratic programming. The winning method predicted different scenarios and optimized over all scenarios jointly using a sample average approximation method.

translated by 谷歌翻译

Statistical Properties of the Entropy from Ordinal Patterns

Eduarda T. C. Chagas , Alejandro. C. Frery , Juliana Gambini , Magdalena M. Lucini , Heitor S. Ramos , Andrea A. Rey

分类：机器学习

2022-09-15

序数模式的统计分析的最终目的是表征它们诱导的特征的分布。特别是，了解大类时间序列模型的对熵统计复杂性的联合分布将允许迄今无法获得的统计测试。在这个方向上工作，我们表征了Shannon经验的渐进分布，用于任何模型，在此模型中，真正的归一化熵既不为零也不为零。我们从中心极限定理（假设大时间序列），多元增量方法和其平均值的三阶校正获得了渐近分布。我们讨论了其他结果（精确，一阶和二阶校正）有关其准确性和数值稳定性的适用性。在建立有关香农熵的测试统计数据的一般框架内，我们提出了双边测试，该测试验证是否有足够的证据拒绝以下假设，即两个信号产生了具有相同Shannon熵的顺序模式。我们将此双边测试应用于来自三个城市（都柏林，爱丁堡和迈阿密）的每日最高温度时间序列，并获得了明智的结果。

translated by 谷歌翻译

Trigger Warnings: Bootstrapping a Violence Detector for FanFiction

Magdalena Wolska , Christopher Schröder , Ole Borchardt , Benno Stein , Martin Potthast

分类：自然语言处理

2022-09-09

我们在新定义的触发警告分配的计算任务上介绍了第一个数据集和评估结果。标记的语料库数据是根据我们自己的档案（AO3）（一个著名的幻想网站）托管的叙事作品编制的。在本文中，我们专注于最常见的触发类型（暴力），并定义文档级二进制分类任务，即是否将暴力触发警告分配给幻想小说，并利用AO3作者提供的警告标签。通过对Corpora进行了四个评估设置培训的SVM和BERT模型，我们编制的汇编$ f_1 $结果范围从0.585到0.798，证明暴力触发警告任务是可行的，这是一项不平凡的任务。

translated by 谷歌翻译

Chosen methods of improving object recognition of small objects with weak recognizable features

Magdalena Stachoń , Marcin Pietroń

分类：计算机视觉 | 人工智能

2022-08-29

许多对象检测模型在小物体检测的几个有问题的方面努力，包括样本数量少，缺乏多样性和低特征表示。考虑到甘斯属于生成模型类，其最初的目标是学会模仿任何数据分布。使用适当的GAN模型将增强低精度数据，从而增加其数量和多样性。该解决方案可能会导致改进的对象检测结果。此外，将基于GAN的架构纳入深度学习模型可以提高小物体识别的准确性。在这项工作中，提出了基于GAN的方法，以改善VOC Pascal数据集上的小物体检测。将该方法与不同流行的增强策略（例如对象旋转，换档等）进行比较。实验基于QuasterRCNN模型。

translated by 谷歌翻译

Ensemble uncertainty as a criterion for dataset expansion in distinct bone segmentation from upper-body CT images

Eva Schnider , Antal Huck , Mireille Toranelli , Georg Rauter , Azhar Zam , Magdalena Müller-Gerbl , Philippe Cattin

分类：计算机视觉

2022-08-19

目的：单个骨骼的本地化和细分是许多计划和导航应用程序中重要的预处理步骤。但是，如果手动完成，这是一项耗时和重复的任务。这不仅对于临床实践，而且对于获取培训数据都是正确的。因此，我们不仅提出了一种端到端学习的算法，该算法能够在上身CT中分割125个不同的骨骼，而且还提供了基于合奏的不确定性度量，有助于单张扫描以扩大训练数据集。方法我们使用受3D-UNET和完全监督培训启发的神经网络体系结构创建全自动的端到端学习细分。使用合奏和推理时间扩展改进结果。我们研究了合奏 - 不确定性与未标记的扫描的前瞻性用途，这是培训数据集的一部分。结果：我们的方法在16个上体CT扫描的内部数据集上进行评估，每个维度的分辨率为\ si {2} {\ milli \ meter}。考虑到我们标签集中的所有125个骨头，我们最成功的合奏中位数骰子得分系数为0.83。我们发现扫描的集合不确定性与其对扩大训练集中获得的准确性的前瞻性影响之间缺乏相关性。同时，我们表明集成不确定性与初始自动分割后需要手动校正的体素数量相关，从而最大程度地降低了最终确定新的地面真实分段所需的时间。结论：结合结合，集合不确定性低的扫描需要更少的注释时间，同时产生类似的未来DSC改进。因此，它们是扩大从CT扫描的上身不同骨分割的训练集的理想候选者。 }

translated by 谷歌翻译

SOLBP: Second-Order Loopy Belief Propagation for Inference in Uncertain Bayesian Networks

Conrad D. Hougen , Lance M. Kaplan , Magdalena Ivanovska , Federico Cerutti , Kumar Vijay Mishra , Alfred O. Hero III

分类：人工智能 | 机器学习 | (统计)机器学习

2022-08-16

在二阶不确定的贝叶斯网络中，条件概率仅在分布中已知，即概率上的概率。Delta方法已应用于扩展精确的一阶推理方法，以通过从贝叶斯网络得出的总和产物网络传播均值和方差，从而表征了认知不确定性或模型本身的不确定性。另外，已经证明了Polytrees的二阶信仰传播，但没有针对一般的定向无环形结构。在这项工作中，我们将循环信念传播扩展到二阶贝叶斯网络的设置，从而产生二阶循环信念传播（SOLBP）。对于二阶贝叶斯网络，SOLBP生成了与Sum-Propoduct网络生成的网络一致的推论，同时更加有效且可扩展。

translated by 谷歌翻译

GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion

Magdalena Proszewska , Grzegorz Beringer , Daniel Sáez-Trigueros , Thomas Merritt , Abdelhamid Ezzerg , Roberto Barra-Chicote

分类：自然语言处理 | 机器学习

2022-07-04

在本文中，我们提出了GLOWVC：一种基于多语言的多语言流程模型，用于与语言无关的语音转换。我们建立在Glow-TTS上，该架构提供了一个架构，该体系结构可以在训练过程中使用语言特征，而无需将其用于VC推理。我们考虑了我们的模型的两个版本：glowVC条件和glowVC阐释。 GLOWVC条件模拟具有扬声器条件流的旋光图的分布，并将Mel-Spectrogragron空间置于内容和音高相关的尺寸中，而GlowVC-Plapic-Plapic-Plocific-Plocific opplicit over opplicit of the SughtliciT模型，无条件的流量和删除空间表示空间 - 内容 - 音调和与扬声器相关的维度。我们根据可见语言和看不见的语言的内部和跨语性转换来评估我们的模型，说话者的相似性和自然性。 GlowVC在清晰度方面的模型大大优于AutoVC基线，同时在语言内VC中获得了高扬声器的相似性，并且在跨语言环境中稍差。此外，我们证明了glowvc-suplicic在自然性方面超过了glowvc条件和自动vc。

translated by 谷歌翻译

Biologically-informed deep learning models for cancer: fundamental trends for encoding and interpreting oncology data

Magdalena Wysocka , Oskar Wysocki , Marie Zufferey , Dónal Landers , André Freitas

分类：人工智能 | 机器学习

2022-07-02

在本文中，我们提供了针对深度学习（DL）模型的结构化文献分析，该模型用于支持癌症生物学的推论，并特别强调了多词分析。这项工作着重于现有模型如何通过先验知识，生物学合理性和解释性，生物医学领域的基本特性来解决更好的对话。我们讨论了DL模型的最新进化拱门沿整合先前的生物关系和网络知识的方向，以支持更好的概括（例如途径或蛋白质 - 蛋白质相互作用网络）和解释性。这代表了向模型的基本功能转变，该模型可以整合机械和统计推断方面。我们讨论了在此类模型中整合域先验知识的代表性方法。该论文还为解释性和解释性的当代方法提供了关键的看法。该分析指向编码先验知识和改善解释性之间的融合方向。

translated by 谷歌翻译

Rapid training of quantum recurrent neural network

Michał Siemaszko , Thomas McDermott , Adam Buraczewski , Bertrand Le Saux , Magdalena Stobińska

分类：机器学习

2022-07-01

时间序列预测是许多人类活动的关键任务，例如天气预报或预测股价。解决此问题的一种解决方案是使用复发性神经网络（RNN）。尽管它们可以产生准确的预测，但他们的学习过程缓慢而复杂。在这里，我们提出了一个量子复发的神经网络（QRNN）来解决这些障碍。网络的设计基于连续变量的量子计算范式。我们证明网络能够学习一些类型的时间数据的时间依赖性。我们的数值模拟表明，与经典网络相比，QRNN收敛到最佳权重。此外，对于少数可训练的参数，它可以实现比后者更低的损失。

translated by 谷歌翻译